Narrow Phonetic Transcription for Development of a Large Vocabulary Isolated Word Recognizer

نویسندگان

  • Lex Olorenshaw
  • Mariscela Amador
  • Ruxin Chen
  • Xavier Menendez-Pidal
چکیده

To build a very large vocabulary (50K) isolated word speech recognizer, speech data from over 200 native speakers of American English was recorded and manually transcribed. This paper explains the transcription method used, the motivation, and preliminary results implemented in the recognizer. The symbol set was expanded to allow for narrow phonetic transcriptions, similar to the level of detail allowed by the International Phonetic Alphabet (IPA). By expanding the symbol set, the following was expected: n more accurate symbolic representation of the acoustic material; n greater consistency between transcribers; n greater flexibility in the use of the phonetic transcriptions during training and testing of the speech recognizer; n greater insight into the acoustic-phonetic variables that affect the performance of the speech recognizer; n over all, to build a better recognizer.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Microprocessor implementation of a large vocabulary speech recognizer and phonetic typewriter for Finnish and Japanese

A flexible and inexpensive real-time speech recognitlon system is described. It operates in the following modes: recognitlon of isolated words from a large vocabulary, and orthographic transcription of (eventually continuous) speech. The main parts are the acoustic processor module that transcribes speech into phonemes, a large-vocabulary lexical-access module that recognizes isolated words on ...

متن کامل

A hybrid word / phoneme-based approach for improved vocabulary-independent search in spontaneous speech

For efficient organization of speech recordings – meetings, interviews, voice mails, and lectures – being able to search for spoken keywords is essential. Today, most spoken document retrieval systems use large-vocabulary recognition. For the above scenarios, such systems suffer from the unpredictable domain, out-ofvocabulary queries, and generally high word-error rate (WER). In [1], we present...

متن کامل

A bi-lingual Mandarin/taiwanese (min-nan), large vocabulary, continuous speech recognition system based on the tong-yong phonetic alphabet (TYPA)

In this paper, we describe the first Mandarin/Taiwanese (Min-nan) bi-lingual, continuous speech recognition system for large vocabulary or vocabulary-independent applications. A phonetic transcription system called Tong-yong Phonetic Alphabet (TYPA) is described and used to transcribe the bilingual Mandarin/Taiwanese lexicons. The Right-ContextDependent (RCD) phonetic continuous-density Hidden ...

متن کامل

Phonetic Distance Measures for Speech Recognition Vocabulary and Grammar Optimization

This paper reports on the correlation between word confusion matrices from Word-Error-Rate (WER) experiments and different phonetic distance measures. The investigated phonetic distance measures are based on the minimum-edit-distances between phonetic transcriptions and the distances between Hidden-Markov-Models (HMM). We show that phonetic distance measures are correlated with word confusion. ...

متن کامل

Combination of Multiple Speech Transcription Methods for Vocabulary Independent Search

Today, most systems use large vocabulary continuous speech recognition tools to produce word transcripts which have indexed transcripts and query terms retrieved from the index. However, query terms that are not part of the recognizer’s vocabulary cannot be retrieved, thereby affecting the recall of the search. Such terms can be retrieved using phonetic search methods. Phonetic transcripts can ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1999